AITopics | reduction layer

Collaborating Authors

reduction layer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A.1 EquivalencebetweenPreConvSAandvanillaSA OurproposedPreConvSAisformulatedinEquation1. f0i =MLPs fli fl+1

Neural Information Processing SystemsFeb-11-2026, 18:37:38 GMT

Inspired by the depthwise separable convolution used in MobileNet [1], we extend this separable idea to graph convolution. We only perform MLPs on point features directly to learn thechannelcorrelation, andleverages theanisotropic reduction toaggregatethespatial correlation.

artificial intelligence, assa module, convolution, (11 more...)

Neural Information Processing Systems

Country: Oceania > Australia > New South Wales > Sydney (0.05)

Technology: Information Technology > Artificial Intelligence (0.32)

Add feedback

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Prabhavalkar, Rohit, Meng, Zhong, Wang, Weiran, Stooke, Adam, Cai, Xingyu, He, Yanzhang, Narayanan, Arun, Hwang, Dongseong, Sainath, Tara N., Moreno, Pedro J.

arXiv.org Artificial IntelligenceFeb-26-2024

The accuracy of end-to-end (E2E) automatic speech recognition (ASR) models continues to improve as they are scaled to larger sizes, with some now reaching billions of parameters. Widespread deployment and adoption of these models, however, requires computationally efficient strategies for decoding. In the present work, we study one such strategy: applying multiple frame reduction layers in the encoder to compress encoder outputs into a small number of output frames. While similar techniques have been investigated in previous work, we achieve dramatically more reduction than has previously been demonstrated through the use of multiple funnel reduction layers. Through ablations, we study the impact of various architectural choices in the encoder to identify the most effective strategies. We demonstrate that we can generate one encoder output frame for every 2.56 sec of input speech, without significantly affecting word error rate on a large-scale voice search task, while improving encoder and decoder latencies by 48% and 92% respectively, relative to a strong but computationally expensive baseline.

latency, proc, reduction, (14 more...)

arXiv.org Artificial Intelligence

2402.17184

Country: North America > United States (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

FedEgo: Privacy-preserving Personalized Federated Graph Learning with Ego-graphs

Zhang, Taolin, Chen, Chuan, Chang, Yaomin, Shu, Lin, Zheng, Zibin

arXiv.org Artificial IntelligenceSep-9-2022

As special information carriers containing both structure and feature information, graphs are widely used in graph mining, e.g., Graph Neural Networks (GNNs). However, in some practical scenarios, graph data are stored separately in multiple distributed parties, which may not be directly shared due to conflicts of interest. Hence, federated graph neural networks are proposed to address such data silo problems while preserving the privacy of each party (or client). Nevertheless, different graph data distributions among various parties, which is known as the statistical heterogeneity, may degrade the performance of naive federated learning algorithms like FedAvg. In this paper, we propose FedEgo, a federated graph learning framework based on ego-graphs to tackle the challenges above, where each client will train their local models while also contributing to the training of a global model. FedEgo applies GraphSAGE over ego-graphs to make full use of the structure information and utilizes Mixup for privacy concerns. To deal with the statistical heterogeneity, we integrate personalization into learning and propose an adaptive mixing coefficient strategy that enables clients to achieve their optimal personalization. Extensive experimental results and in-depth analysis demonstrate the effectiveness of FedEgo.

fedego, personalization layer, reduction layer, (9 more...)

arXiv.org Artificial Intelligence

2208.13685

Country:

North America > United States > Virginia (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)

Add feedback

A Proper Orthogonal Decomposition approach for parameters reduction of Single Shot Detector networks

Meneghetti, Laura, Demo, Nicola, Rozza, Gianluigi

arXiv.org Artificial IntelligenceJul-27-2022

As a major breakthrough in artificial intelligence and deep learning, Convolutional Neural Networks have achieved an impressive success in solving many problems in several fields including computer vision and image processing. Real-time performance, robustness of algorithms and fast training processes remain open problems in these contexts. In addition object recognition and detection are challenging tasks for resource-constrained embedded systems, commonly used in the industrial sector. To overcome these issues, we propose a dimensionality reduction framework based on Proper Orthogonal Decomposition, a classical model order reduction technique, in order to gain a reduction in the number of hyperparameters of the net. We have applied such framework to SSD300 architecture using PASCAL VOC dataset, demonstrating a reduction of the network dimension and a remarkable speedup in the fine-tuning of the network in a transfer learning context.

artificial intelligence, detection, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2207.13551

Country: Europe > Italy > Friuli Venezia Giulia > Trieste Province > Trieste (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback